PPI-IRO: a two-stage method for protein-protein interaction extraction based on interaction relation ontology

نویسندگان

  • Chuanxi Li
  • Peng Chen
  • Rujing Wang
  • Xiu-Jie Wang
  • Yaru Su
  • Jinyan Li
چکیده

Mining Protein-Protein Interactions (PPIs) from the fast-growing biomedical literature resources has been proven as an effective approach for the identification of biological regulatory networks. This paper presents a novel method based on the idea of Interaction Relation Ontology (IRO), which specifies and organises words of various proteins interaction relationships. Our method is a two-stage PPI extraction method. At first, IRO is applied in a binary classifier to determine whether sentences contain a relation or not. Then, IRO is taken to guide PPI extraction by building sentence dependency parse tree. Comprehensive and quantitative evaluations and detailed analyses are used to demonstrate the significant performance of IRO on relation sentences classification and PPI extraction. Our PPI extraction method yielded a recall of around 80% and 90% and an F1 of around 54% and 66% on corpora of AIMed and BioInfer, respectively, which are superior to most existing extraction methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ethanol and Cancer Induce Similar Changes on Protein Expression Pattern of Human Fibroblast Cell

Abstract Ethanol has a vast consumption around the world. Many researches confirmed some adverse effect of this component on human health. In addition, recent studies showed significant alteration in both cellular population, and protein profile of human foreskin fibroblast cell line (HFFF2) in the specific dosage of ethanol. Here, the role and interaction of some proteins (characterized by sig...

متن کامل

Ethanol and Cancer Induce Similar Changes on Protein Expression Pattern of Human Fibroblast Cell

Abstract Ethanol has a vast consumption around the world. Many researches confirmed some adverse effect of this component on human health. In addition, recent studies showed significant alteration in both cellular population, and protein profile of human foreskin fibroblast cell line (HFFF2) in the specific dosage of ethanol. Here, the role and interaction of some proteins (characterized by sig...

متن کامل

Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks

Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...

متن کامل

Prediction of Coffee Effects in Rats with Healthy and NAFLD Conditions Based on Protein-Protein Interaction Network Analysis

Background and objectives: Non-alcoholic fatty liver disease (NAFLD) is a common liver condition. On the other hand, coffee consumption has shown promising for gastrointestinal diseases.  Detection of the most valuable biomarkers of decaffeinated coffee treatment in healthy and non-alcoholic fatty liver disease conditions was the aim of the present study. Methods:</stro...

متن کامل

On the efficacy of per-relation basis performance evaluation for PPI extraction and a high-precision rule-based approach

BACKGROUND Most previous Protein Protein Interaction (PPI) studies evaluated their algorithms' performance based on "per-instance" precision and recall, in which the instances of an interaction relation were evaluated independently. However, we argue that this standard evaluation method should be revisited. In a large corpus, the same relation can be described in various different forms and, in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • International journal of data mining and bioinformatics

دوره 10 1  شماره 

صفحات  -

تاریخ انتشار 2014